Search CORE

37 research outputs found

05051 Abstracts Collection -- Probabilistic, Logical and Relational Learning - Towards a Synthesis

Author: De Raedt Luc
Dietterich Tom
Getoor Lise
Muggleton Stephen H.
Publication venue: Dagstuhl Seminar Proceedings. 05051 - Probabilistic, Logical and Relational Learning - Towards a Synthesis
Publication date: 01/01/2006
Field of study

From 30.01.05 to 04.02.05, the Dagstuhl Seminar 05051 ``Probabilistic, Logical and Relational Learning - Towards a Synthesis\u27\u27 was held in the International Conference and Research Center (IBFI), Schloss Dagstuhl. During the seminar, several participants presented their current research, and ongoing work and open problems were discussed. Abstracts of the presentations given during the seminar as well as abstracts of seminar results and ideas are put together in this paper. The first section describes the seminar topics and goals in general. Links to extended abstracts or full papers are provided, if available

Dagstuhl Research Online Publication Server

Reinforcement learning in scheduling

Author: Dietterich Tom G.
Ok Dokyeong
Tadepalli Prasad
Zhang Wei
Publication venue
Publication date
Field of study

The goal of this research is to apply reinforcement learning methods to real-world problems like scheduling. In this preliminary paper, we show that learning to solve scheduling problems such as the Space Shuttle Payload Processing and the Automatic Guided Vehicle (AGV) scheduling can be usefully studied in the reinforcement learning framework. We discuss some of the special challenges posed by the scheduling domain to these methods and propose some possible solutions we plan to implement

NASA Technical Reports Server

Prune-and-Score: Learning for Greedy Coreference Resolution

Author: Chao Ma
J. Walker Orr
Prasad Tadepalli
Prashanth Mannem
Tom Dietterich
Xiaoli Fern
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2014
Field of study

We propose a novel search-based approach for greedy coreference resolution, where the mentions are processed in order and added to previous coreference clusters. Our method is distinguished by the use of two functions to make each corefer-ence decision: a pruning function that prunes bad coreference decisions from fur-ther consideration, and a scoring function that then selects the best among the re-maining decisions. Our framework re-duces learning of these functions to rank learning, which helps leverage powerful off-the-shelf rank-learners. We show that our Prune-and-Score approach is superior to using a single scoring function to make both decisions and outperforms sever-al state-of-the-art approaches on multiple benchmark corpora including OntoNotes.

CiteSeerX

Crossref

Overfitting and Undercomputing in Machine Learning

Author: Tom Dietterich
Publication venue
Publication date: 01/01/1995
Field of study

suggests a reasonable line of research: find algorithms that can search the hypothesis class better. Hence, there is been extensive research in applying second-order methods to fit neural networks and in conducting much more thorough searches in learning decision trees and rule sets. Ironically, when these algorithms were tested on real datasets, it was found that their performance was often worse than simple gradient descent or greedy search [3, 5]. In short: it appears to be better not to optimize! One of the other important trends in machine learning research has been the establishment and nurturing of connections between various previously-disparate fields including computational learning theory, connectionist learning, symbolic learning, and statistics. The connection to statistics was crucial in resolving this paradox. The key problem arises from the structure of the machine learning task. A learning algorithm is trained on a set of training data, but then it is applied to mak

CiteSeerX

Applying the weak learning framework to understand and improve C4.5

Author: Tom Dietterich
Publication venue: Morgan Kaufmann
Publication date
Field of study

There has long been a chasm between theoretical models of machine learning and practical machine learnin

CiteSeerX

Machine learning

Author: Tom Dietterich
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Performance Environment Knowledge Learning

Author: Pat Langley
Tom Dietterich
Publication venue
Publication date
Field of study

The field of machine learning has made major strides over the last 20 years. This document summarizes the major problem formulations that the discipline has studied, then reviews three tasks in cognitive networking and briefly discusses how aspects of those tasks fit these formulations. After this, it discusses challenges for machine learning research raised by Knowledge Plane applications and closes with proposals for the evaluation of learning systems developed for these problems.

CiteSeerX

State Aggregation in Monte Carlo Tree Search

Author: Dietterich Tom
Fern Alan
Hostetler Jesse
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 21/06/2014
Field of study

Monte Carlo tree search (MCTS) algorithms are a popular approach to online decision-making in Markov decision processes (MDPs). These algorithms can, however, perform poorly in MDPs with high stochastic branching factors. In this paper, we study state aggregation as a way of reducing stochastic branching in tree search. Prior work has studied formal properties of MDP state aggregation in the context of dynamic programming and reinforcement learning, but little attention has been paid to state aggregation in MCTS. Our main result is a performance loss bound for a class of value function-based state aggregation criteria in expectimax search trees. We also consider how to construct MCTS algorithms that operate in the abstract state space but require a simulator of the ground dynamics only. We find that trajectory sampling algorithms like UCT can be adapted easily, but that sparse sampling algorithms present difficulties. As a proof of concept, we experimentally confirm that state aggregation can improve the finite-sample performance of UCT

Association for the Advancement of Artificial Intelligence: AAAI Publications

Three New Algorithms to Solve N-POMDPs

Author: Chadès Iadine
Dietterich Tom
Dujardin Yann
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 12/02/2017
Field of study

In many fields in computational sustainability, applications of POMDPs are inhibited by the complexity of the optimal solution. One way of delivering simple solutions is to represent the policy with a small number of alpha-vectors. We would like to find the best possible policy that can be expressed using a fixed number N of alpha-vectors. We call this the N-POMDP problem. The existing solver alpha-min approximately solves finite-horizon POMDPs with a controllable number of alpha-vectors. However alpha-min is a greedy algorithm without performance guarantees, and it is rather slow. This paper proposes three new algorithms, based on a general approach that we call alpha-min-2. These three algorithms are able to approximately solve N-POMDPs. Alpha-min-2-fast (heuristic) and alpha-min-2-p (with performance guarantees) are designed to complement an existing POMDP solver, while alpha-min-2-solve (heuristic) is a solver itself. Complexity results are provided for each of the algorithms, and they are tested on well-known benchmarks. These new algorithms will help users to interpret solutions to POMDP problems in computational sustainability

Association for the Advancement of Artificial Intelligence: AAAI Publications